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Abstract 

We characterize the quantum gate fidehty in a state-independent manner by giving an exphcit basis-independent 
expression for its variance which can be extended to calculate all higher order moments. Using these results we obtain 
a simple expression for the variance of a single qubit system and deduce the asymptotic behavior for large-dimensional 
quantum systems. Applications of these results to quantum chaos and randomized benchmarking are discussed. 
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I. INTRODUCTION 

The building blocks of a quantum computation are quantum logic gates, unitary operations that are applied in a 
specific sequence to the physical systems that encode quantum information. In theory, any quantum algorithm can be 
implemented with high precision by applying a correctly chosen sequence of gates - but in practice, gates have errors. 
In real experiments, we attempt to apply an ideal gate U, but what really happens is a noisy quantum operation £. 
If we want to end up with a dynamical evolution close to the desired algorithm, £ had better be "close" to U. 

How close £ is to U, operationally, depends on the state of the system they act on. Some states may evolve 
identically under £ and U, while for other states they produce drastically different outputs! Understanding quantum 
noise and designing error-resistant devices requires state-independent characterizations of the noise. One way to get 
such a quantity, beginning with a state-dependent gate fidelity J^{p), is to average it over all [pure] input states. This 
average gate fidelity, T, provides a concise, useful measure of error. 

However, it provides no information about fluctuations in the gate fidelity - i.e., how the error varies over input 
states. The magnitude of the fluctuations is a useful diagnostic. It provides information about the worst-case fldelity, 
which is relevant for fault-tolerant design. Large fluctuations also suggest that the average error may be dominated 
by a few very error-prone states, in which case addressing those states can produce dramatic improvements in average 
fldelity. Large fluctuations may also indicate hidden high-fldelity information-preserving structures such as pointer 
bases or decoherence-free subspaces 1 1 . 

In this paper, we calculate the variance of the gate fldelity analytically, and discuss how it might be measured in 
experiments (and challenges to doing so!). Moreover, we develop a general method for calculating higher moments of 
the gate fldelity, which can be applied to other purposes. This problem has been considered before: Rcf. ^ solved the 
special case where £ maps pure states into pure states. Our calculation applies to general quantum operations. We 
apply it to a couple of interesting speciflc cases: operations acting on a single qubit, and on very large (dim('H) — > oo) 
systems. 

We begin, in Section [Hi by introducing the framework and formulae that we will use. In Section [HI Al we calculate 
the average gate fidelity as a warm-up, and confirm agreement with previous calculations. We then calculate the 
variance in Section IIII Bl and briefly discuss higher order moments in Section IIVI Finally, we calculate explicit 
expressions for gates on a single qubit and asymptotically large systems (Section |V|. Wc conclude, in Section IVTl 
by discussing applications to randomized noise characterization, and to estimating fldelity decay under controlled 
perturbations of chaotic systems. 
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II. BACKGROUND 

Before beginning, we set some notation for the rest of the paper. H represents a system's Hilbert space, assumed 
in this paper to be of finite dimension d. L{H) is the set of all linear operators on H (i.e., dx d matrices), and 'D{T-L) 
is the set of (mixed) quantum states on "H, containing aU the positive, trace-1 operators in L{'H). Pure states are 
represented by projectors in the complex projective space CP''"^. 



A. Quantum Operations 

Quantum operations - a.k.a. processes or channels - describe the dynamical evolution of quantum systems over 
a period of time. These dynamics may be reversible or irreversible, and they may even involve adding or discarding 
parts of the system, so the initial and final Hilbert spaces need not be identical. Let T-Li and T-L2 be Hilbert spaces 
of dimension di and d2, representing (respectively) the input and the output of a quantum dynamical process. We 
will denote the set of all linear (super)operators from L{Hi) to L{7i2) by T{'Hi,H2), and if Hi = % = H2, denote it 

Tin). 

Not every linear superoperator £" is a valid quantum operation. First, £ must preserve the trace of the input state 
pi E ViHi), for it represents total probability. Furthermore, a superoperator £ that maps some positive semidefinite 
pi > to a non-positive operator is physically impossible - it is not a positive map. In fact, to represent a valid 
operation, a superoperator must satisfy the even stronger condition of complete positivity: given an ancillary system 
represented by a Hilbert space A, £ must map every joint state pui.A G T^{T~ii ®.4) to a positive semidefinite state of 
7^2 ® A. Complete positivity is both necessary and sufficient (together with tracc-prcscrvingncss) for £ to represent 
a valid physical process, so we will restrict our focus to these CPTP linear maps. 

There are various useful ways of representing a CPTP map £ (see Ref. |3| for more details). The most brutally 
straightforward one comes from the observation that L{T-L) is itself a Hilbert space of dimension d^ under the Hilbert- 
Schmidt inner product, 

{A\B) = tr {A'^B) . 

Let {Bi} be any orthogonal basis for L{T-L) with the normalization condition tr (^bJBj^ = d. This choice of normal- 
ization, while nonstandard, is convenient for a variety of reasons - e.g., it allows the basis matrices to be unitary. 
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Using such basis, £ can be written as a x (P matrix with coefficients 

^ {B.\E\B,) J^{BlEiB,)) 

The generahzation to maps with different input and output spaces Hi and ■H2 shouid be obvious. 

Whiie this representation is usefui and straightforward, it can represent any hnear map on L('H). Complete positivity 
is neither mandated nor easily tested in this representation. So we turn to the C'hoi-Jamiolkowski representation. As 
before, let {Bi} be an orthogonal basis for L{H). Now let % be a x matrix of coefficients and define 

£{p)^^X^.JB,pB]. 

Like the standard representation given above, this "x-matrix" representation is capable of representing any linear map 
on L{'H). (This is easily proven by choosing a basis consisting of the normalized matrix units {•\/d|fc)(^| : fc, / = 1 . . . d}.) 
However, £ is completely positive if and only if x is positive semidefinite! This statement is basis-independent; changing 
{Bi} corresponds to a unitary transformation of x- If X is positive semidefinite, then it can be diagonalized, and £ 
can be written as 

£{p)^Y.^^p^l w 

i 

This is the Kraus form, or operator-sum representation, of £. Any linear map of the form given in Eq. ([1]) is completely 
positive (even when the {Ki} are not orthogonal), and every CP map can be written this way. 

To actually construct x from a given £, we consider an augmented Hilbert space H'Si'H. Let |^) be the symmetric 
maximally entangled ("Bell") state, 

1 

I*) = |a), 

^ a— 1 

and define the Jamiolkowski state of 8 as the Choi matrix of S scaled by a factor c?, 

a,b=l 

It is easy to show by direct calculation that the elements of pj, in the standard basis {|fc) (E) \l)}, are identical to the 
elements of the x matrix in the supernormalized basis of matrix units {Bki — ^/d\k){l\}\ 

Xij,kl = {Pj)ij,M ■ 

So, although x and pj act on different vector spaces, the spaces and the matrices themselves are isomorphic. Thus, 
X and Pj are merely different aspects of a single Choi- Jamiolkowski representation. We will use x for both in the 
remainder of this paper. 
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B. The Choi-Jamiolkowski isomorphism; partial traces, partial transposes, and no- 
tation 

The power and utility of the Choi-Jamiolkowski representation are due in large part to the fact that it establishes 
an isomorphism between (on the one hand) completely positive maps on i('H), and (on the other hand) a certain 
class of bipartite states on 1-1®%. The states in question are those where the state of the second, ancillary system 
{p2) is maximally mixed, for the Jamiolkowski state is produced by applying £ ® 1 to a, maximally entangled state. 
Since p2 is initially maximally mixed, and the map only acts on the first system, the Jamiolkowski state must also 
have p2 — 1/d. 

We will make extensive use of the Jamiolkowski representation. So a channel £ will be represented by a matrix x, 
which generally is interpreted as a bipartite density matrix. Two important operations on such a state are: 

• The partial trace over one subsystem. For any bipartite matrix we will denote the partial trace over subsystem 
i by tri[^]. When the partial trace is applied to a state, it generates the reduced density matrix of the remaining 
subsystem: p2 = trip. 

• The partial transpose over one subsystem. For any bipartite matrix we will denote the partial transpose of 
A with respect to the zth subsystem by A^^ . (Similarly, indicate the full transpose of A, with respect to 
both subsystems). Partial transposition is not a completely positive operation; in particular, it transforms many 
entangled states into negative matrices. Interestingly, it appears naturally in our result. An explicit formula for 
the partial transpose is given by: 

A^^ = J2 {\k){l\<E}l)A{\k){l\<»l) 

kd=0 
d-1 

A^- = {l<E)\k){l\)A{l<»\k){l\). 

k,l=0 

We will also use the Choi-Jamiolkowski isomorphism occasionally. Its primary function is to associate bipartite 
states with superoperators, but the isomorphism extends to other objects in the relevant vector spaces - in particular, 
bases. We will find it essential for some calculations to write a channel's ^-matrix in specific bases: this will either 
be a basis for L{7i), or a basis for H (E)'H. The isomorphism is as follows: If {Pk} is a basis for L{7i), then define 
\ipk) = -Pfe <8) 1 1^*), and is the corresponding basis for 'H®'H. Choosing a basis in this way ensures that x is 

entry-wise identical in the two representations. 



C. Generalized Pauli and Bell bases 

For qubits {d ~ 2), the Pauli operators are an exceptionally convenient basis for L{'H). The corresponding basis 
of bipartite states is the Bell basis. In higher dimensions, it's generally not possible to pick a basis with all the nice 
properties of the Pauli operators, but we can generalize most of them. 

When necessary in our calculations, we will make use of a hermitian, orthogonal, supernormalized basis of matrices 
{Pa] satisfying the following conditions: 

Po = 1. 

In dimension d — 2, the ubiquitous Pauli operators form such a basis; in higher dimensions, the generalized Gell-Mann 
operators |j| satisfy the conditions. We require only the listed conditions - most importantly, the fact that Pq ^ 1 
and therefore every other P^. is traceless, which singles out the xo,o matrix element as a unitary invariant of f - so we 
will not specify a particular basis. The corresponding basis of bipartite states is orthonormal, and because Pq = 1, 
we have l-^o) = |^)- We will refer to this basis as the "generalized Bell basis," though it does not by any means 
generalize all the properties of the Bell states. 

Finally, we will make extensive use of the bipartite projector 

Xo = 1^0X^0 1 = |*)(*|. 

It is the X representation of the identity channel 5 = 1, which motivates the notation xo- Moreover, it enables us 
to write basis-specific expressions in a basis-invariant way - e.g., the unitary invariant xo,o mentioned above can be 
written in a basis-invariant way as 

Xo,o = (V'ol X IV^o) = tr [xXo] • 

D. Gate Fidelity 

Suppose that U is a unitary quantum operation (i.e., U{p) = UpU^ for some unitary U), and £ is a noisy imple- 
mentation of U. Then the gate fidelity between £ and U, given state p, is 
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:f£,u (p) = [tv^ VWPip)VW) 



It is simply the state fidelity between £ (p) and U (p) . defined in general as 



F{p,a) = itr^ ^a^j 



Fidelity measures indistinguishability: F = 1 means the states are identical, while F = implies that a single measure- 
ment can distinguish them perfectly. Thus, the gate fidelity F£,u{p) is a convenient measure of how distinguishable 
the actions of lA and £ are - on the state p. 
If the input state is pure (so p ~ then 

If {Kk\ are Kraus operators for £, then we can rewrite this using the cyclic property of the trace as 



J^S,u{<P) = tr 



[/|0)(0|C/t^/v,|</,)(0|4 



tr [m^j^i o mm)] = tr mmmm 



where A ~ o £ represents how much £ deviates from U. 

We would like a performance measure that removes the state dependence - an invariant property of the gate fidelity's 
distribution. If we focus our attention on pure input states, then this distribution is well defined, for the set of pure 
quantum states, CP"^"^, admits a unique (and natural) invariant distribution. It is the Fubini-Study (FS) measure 5| 
which we will denote by fips- This is the Borel measure induced by the Fubini-Study metric on CP^~^, and also the 
unitarily invariant Haar measure on CP"^^^. The average fidelity under fips has been derived previously (see, e.g., 
Rcfs. y, lZ|), and we will derive it again in Section fllll as a warmup for computing higher moments of the gate fidelity 
distribution. 



E. Permutation Operators and the Symmetric Subspace 

We are trying to compute averages over a unitarily invariant measure, and therefore over the unitary group itself. 
To do so, we will begin by transforming polynomial functions of degree k into linear functions on k copies of the 
Hilbert space in question. We will then rely on a simple and beautiful result called Schur- Weyl duality, which states 
(in essence) that the actions of the unitary group and the permutation group (on such a fc-fold tensor product) 
commute, and their irreducible representations (irreps) share a set of labels. Rather than discuss Schur- Weyl duality 
in detail, we will only introduce the tools that we need. In this section, we will briefly discuss permutation operators, 
the symmetric group, the totally symmetric subspace of T-L®^ , and a couple of technical results that will be useful 
later. 



Let 7i he a Hilbert space and 'H'^'^ a tensor product of k copies of it. If Sk is the symmetric group on k objects 
and (T G S'fc is a permutation, then there exists a unitary operator Va that implements cr on Ti.®^: 

Va (IV-i) ® - ® IV'fc)) = ® - ® \^<r-Hk)) ■ 

The totally symmetric subspace of 7^®'^ comprises all the states that are invariant under every such permutation 
operator - or, to put it another way, it is the intersection of the +1 eigenspaces of all Va- The projector onto this 
space is given by 



8|). Suppose we 



fc! 



This projector appears in integrals over the unitary group, for the following reason (see Ref. 
take a state G H, and then construct the projector onto its fc-fold tensor product, This projector is a 

+1 eigenoperator of every permutation, so it lies in the totally symmetric subspace. Now, if we take the average of 



all such projectors according to the unitarily invariant measure (denoted \ip){'4'\^'^) , then we get an operator in 
L ('H®'"') that: (i) is invariant under all unitaries (ii) is supported on the totally symmetric subspace; and (iii) 

has unit trace. By Schur's Lemma, a unitarily invariant operator is a weighted sum of projectors onto irreducible 
representations of the unitary group. The only such operators supported on the totally symmetric subspace are 



proportional to TTgym itself. Since | ?/>)(■!/> j®*^ has unit trace, 

I^X^I^'^d,... = "-^-^^l^j.,. . (2) 
^acpd-i tr [7rsy„i(fc,c(jJ 

The normalization constant is easy to evaluate by counting arguments. The symmetric subspace of T-i®^ is spanned 
by the bosonic Fock states, jni, 712, • . . , JT-d), which are indexed by the number of particles Ui in state i, subject to 
rii — k. Counting such states, we get 

(k + d-\\ d{d+\){d + 2)...{d + k-l) 
tr[7rsym(fc,d)J = ( 1= . (3) 

Suppose that we have k operators Ai, in L (H), and a permutation a G Sk written as a product of disjoint 
cycles {ai...ar) ... {aq...ak)- Then 

tr[(Ai® Va] ^ ti-[Aa,...Aa^] ...tr [Aa^...Aa,] ■ (4) 

So, to calculate tr [{Ai (E) ... (E) Ak) Pa], we can write cr in cyclic notation, replace "i" with operator Ai, and replaces 
each "( )" with "tr[ ]". 



III. CALCULATING THE VARIANCE OF THE GATE FIDELITY 

We can use these tools to rederive the average value of Fs,u (hereafter denoted simply by F), and also to calculate 
its variance: 

Var(J') = J2 - J^ 

Our ultimate goal is a basis- independent expression (Eq. pT|). entirely in terms of the x matrix for h. = U'^ o 8. 



A. Average Fidelity 

To determine Var(J^), we need to calculate both F and F"^. Fortunately, the tools in the previous section can be 
used calculate any moment of F, although the calculation of J^" gets rapidly harder with increasing n. So we begin 
with F^ which is already well-known [7[, as a sort of warmup. 

We begin by expanding the state-dependent gate fidelity in terms of A's Kraus operators {Ki}, 

i 

i 

This expression is a Hilbert-Schmidt inner product between (i) a term including all the Kraus operators, and (ii) a 



term including all the |'!/))-dependence. To average over ?/), we need only average the second term, using Eq. 



F ^ F 



,t\ 7rsym(2,(i) 



tr (TTsy m 



(5) 



We now expand 7rsyi„(2,d) as a sum of permutation operators, invoke Eq. (U) to evaluate the traces, and use Eq. ^ 
to evaluate the normalization: 



F 



2tr[ 



^.vn.(2,d)l E E *4 



E.(tr[/^.]tr 

'symv2,d)J ^ ^^^^ LV /J 

If we write F in terms of the x matrix, as A(p) = J2i m Xi,mPipPrn, then the same calculation yields 
2 



F = 



(2, d)] = ^i^rn (tr [Pi] tr [P„,] + tr [P,P, 



d+l ' 

which agrees with the results from Refs. 6|, lZ[. We can rewrite this in a basis-invariant fashion by recalling that 



Xo,o = tr [xxo], so 



— ^ tr [xxo] d+l 
d+l 



(6) 
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We observe that tr [xxo] represents the overlap of A with the identity channel, and therefore how mueh A leaves the 
input state unchanged. It is also a unitary invariant of A; xo,o does not change if we rotate A by a unitary channel 
U, mapping A — )- U^^ o KoU. 



B. Variance of the Fidelity 

Now, let's tackle the calculation of F"^. As done previously, we expand in terms of A's Kraus operators, 

TWi^m) = tr [Ad^xv'i) • ivxv-i]' = Y.ir[K,mmAKi\m\\Y.^AKMmMK]\ 



tr \[k, K} ® Kj ® Xj) • 



(7) 



and then use Eq. ^ to simplify the average, F"^ , as 

Finally, we write 7rsym(2, d) as a sum of permutation operators, invoke Eq. (j4]) to evaluate the traces, and use Eq. ([3]) 
to evaluate the normalization: 



d{d+l){d + 2){d + i) 







tr [Kj\ tr 


H 


+ tr 


'k,k] 


tr 










d{d + l){d 


+ 2)(d 


+ 3) 



(8) 

There are 24 products of traces in the sum, each corresponding to one of the 4! permutations of 4 objects, so the 
ellipsis in the last equation represents 22 more terms. 

In this case, it's more productive to use the matrix basis {Pi\ and write F using the x matrix. The same calculation 
then yields 

(tr [Pi] tr \PJ\ tr [P„] tr [P,] + tr [P,P,] tr [P„] tr [P„] + ...) 



d{d+l){d + 2){d + ?,) 



(9) 



By writing out all 24 terms in the summation (excluded here for reasons of space and extreme tediousness), we can 
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use the properties of the {Pi} basis - e.g., tiPiPj = dSij, and trP^ = dSi^ - to simphfy this expression to 



J-2 



d{d+l){d + 2){d + 3) 



( dHr\xx^f 

+ d\Y [xo (2x2 + + X^X + 2x)] 
+ d2 (4tr[xXo] + tr [xx^] + tr [x'] + l) 
+ rf(2Eztr[(xi,o + Xo,/)fiA(l)]+3) 



V 



2tr 



E; m Xl.rnPlHPrr 



+ tr 



(A(l)r 



(10) 



All but three of the terms in Eq. ([10)) are expressed solely in terms of A's x-matrix. The exceptions are: 



• tr 



(A(l)) comes from 4-cycle permutations like a = (1234), which produce terms of the form 



E/mnr XLrnXn.r^^iPlPmPjiPr]- 

2dtr {xlo + Xod) Pi^ (1)] comes from 3-cyele permutations like a — (123)(4), which produce terms of the 

form J2hnnr Xi,mXn ,rtr [P;P„Pr]tr [F„i] . 



• 2tr 



J2i mXi,mPi^{Pm) comcs from 4-cycle permutations like a = (1324), which produce terms of the form 



S/mnr X^mXn,rtr[P;P,iPmPr] ■ 

Our next order of business is to rewrite these terms in a basis-invariant fashion, using x, Xo, the partial transpose, 
and the partial trace (see Section lll| . 

The first term is easy. It's straightforward to verify that 

AQ)=tr.x, 



tr 



{A{l)f = d^tr [(tr^x)'] 



(11) 



We can rewrite the second term using the non-Hermitian operator 

XXo = ® 1) ^" ® 1) Xo = 51 ® 1) ^0 = ^ ^ Xi,oPi\i){j\ 

and its adjoint xoX- Partial tracing over the second (ancillary) system yields 



ti"2(xXo) = \YxLoPh 

I 

tr2(xox) = \Yxa,iPu 
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which provides a basis-invariant expression for the second term: 



tr 



cftr [tT2 ixXo + Xax) tr2x] • 



(12) 



To rewrite the third exceptional term, we apply a few more tricks. First, we observe that for any bipartite operator 

tr \x^^ {A (g) B)] = -ti- [AA{B)] . 
This is easily shown from the definition of x- Next, we note that since Xo — 2 Si m X/mlO(™l ® its partial 



transpose (over either subsystem) is 



Ti T, 

Xo Xo 



i^|0(m|® |m> 



This bipartite operator is proportional to the unitary SWAP gate (which wc denote S), which maps |Z)(g)|m) — > |m)®|Z). 
Now, consider the operator S{Sx)'^^ , which can be written out as: 



\ ijlm 



Ti 



^s\Y,xi^p^\M\®Pi\m 



ijlm 



Irn 



Together, these two observations imply that 



tr 



{SiSxf^f'-] = ;^5^Ximtr[PzA(P,„)], 



(13) 



Im 



but tr [X^^y^^] = tr [Xy] (just as for the full transpose), so the two partial transposes cancel. Substituting in 
S — dxQ^ , we get a basis-independent expression for the third exceptional term: 



^X/,™tr[P,A(P„,)] = rf*tr [xXonXo^x)"^^ 
= dhr 



l.m 



Xo X] iXo X 



XXo I 



(14) 



Hence in total if 



as = tr(xxo)' + 2tr(^(xXo') {xXo') 

63 = 2tr (x'xo) + tr (xX^Xo) + tr (x^XXo) + 2tr (xxo) + 2tr (tra [xXo + Xox] tra [x]) , 

C3 = 4tr (xxo) + tr (xx^) + tr (x') + 1 + tr ((trax)') , 

ds = 3. 

then, 



d4 + Q(P + 11^2 + 6d ■ 

From Eq. (jG]) we have, 



—2 _ ad"^ + bd+l 
^ rf2 + 2rf + 1 

where, 



a = tr (xxo)^ , 
b = 2tr(xXo)- 

Taken together, Eq.'s ([T5|) and give the following expression for Var (J"), 



, 04^^ + 646?^ + C4d^ + did^ + Bid + /4 
" (d + l)3(d + 2)(d + 3) 



where. 
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a4 = 03 — a, 

= ^3 + 203 — 6 — 6a, 

C4 = 03 + 263 + C3 — 11a — 66 — 1, 

di = 63 + 2c3 + (i3 — 6a — 11& — 6 

64 = C3 + 2cJ3 — 11 — 66 

U = ^3-6. (18) 

IV. HIGHER ORDER MOMENTS 

We briefly discuss how to calculate both the higher order moments J^"' and central moments (J^ — i^)"' of the gate 
fidelity J-. We have already given a detailed analysis of the m = 1 and m — 2 cases, and have provided explicit 
expressions for T'^, and Var(J^) = J^^ _ jr^ jj^ terms of the Jamiolkowski state of a quantum operation (note 
that the first central moment is just T). The central moments contain valuable information about the distribution 
of the gate fidelity. The second central moment (variance) is a measure of the spread of the distribution, the third 
central moment measures the skewness, and so on. Since the m'th central moment is just (j-" — F)"^ and we have an 
expression for the expression for the m'th central moment is easily obtained if each of T'^ is known for fc = 1, m. 

For m e N, the m'th power of J^, J^™, has action on pure state 



(iv^)(^i) = tr (A(iV')(v^i) wmr = E ((^'^1 ® <) ® - Eti- ((^'™ ® <.) ® 

In an analogous method to that used in calculating an expression for the variance, we can use the results regarding 
permutation operators and the symmetric subspace described in Sec. Ill El to obtain 



jrr, 



tr {TTsym {2m, d)) 



E tr((x,,®< 



K,^ ® K}^ -K.y^ (2m, D) 



i;^^ E (^h) tr (<) ...tr (A^J tr (a^) + ... + tr (a',,< ...A-,„< 
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where again the {Ki} are a set of Kraus operators for A. There are (2to)! terms in the sum corresponding to the fact 
that there are (2?ti)! elements in the symmetric group S2m and we have used the fact that. 

tr(7rsym {2m, D)) = 

Expanding the Ki in terms of the basis {Pi\ with the previously discussed properties gives, 



2m + - 1 
d-l 



^ (2m)!f^"'+^-^) ^ Xn,,n,-X.„„.™, (tr(P,,Jtr(P,,J...tr(P,„Jtr(P,, 



■Til ''■"^2 

1 



V. THE SINGLE QUBIT AND LARGE-DIMENSIONAL CASES 

In this section we analyze the behavior of Var (J^) in two useful cases, that of a single qubit (d=2) and as d grows 
to oo. The calculations in both cases are straightforward but tedious and so are contained in the appendix. We first 
look at the case of a single qubit. 

For a qubit system, one can obtain much simpler equations for Var {J-) than Eq. ()17p . The calculation involves 
starting from Eq. grouping certain terms together, and considering various cases. The result of the calculation is 
that the second moment of J- is given by. 



^2 ^ 



- 48tr (xxo)' + 64tr (xxo) + 24tr (xx^xo + X^XXo) + 32tr (x^Xo) + 4tr (xx^) + 12tr (x^) + 4tr ((trax)') + 6 



120 

Using Eq. (fT6| we obtain the following particularly simple analogue of Eq. (fT7|) . 



Var (^) = -^ + ^tr (xXo) - ftr (xXo)' + ^tr (x^Xo) + ^tr (x^) 

+ Jtr (xx^Xo + X^XXo) + ^ (tr (xX^) + tr ((trax)')) ■ (19) 

In the case that d — > oo, it is relatively straightforward to deduce the behavior of Var(J^). The idea is to use a 
suitable expression for the variance and bound the coefficients of the powers of d. The result is that for large d. 
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1 



0(3). (20) 



Therefore Var (J-) — > in the inverse of the dimension of the quantum system. A key point here is that Eq. (j20p is 
completely general: for any quantum operation £ and any unitary operation U, the gate fidelity between £ and U 
has variance that diminishes as O (^). 

VI. DISCUSSION 

We have given a method for calculating all moments of the gate fidelity J-£^u between a unitary U and a quantum 
operation £. Using this method we have obtained a closed form, basis-independent, expression for Var {J-£sa) in terms 
of the Jamiolkowski state for K = o £ . A simple expression for the variance is given in the single qubit case and 
we have shown that for large quantum systems the variance scales as O (^) for any £ and lA. 



There is growing interest in the use of twirling 



9|, |10[ and randomization methods 



Uj to estimate partial 



information about the unknown noise affecting the implementation of quantum memory or quantum gates in a 
completely scalable manner |l^ [isj . In addition to the eigenvalues of the twirled noisy operation (which includes the 
average gate fidelity as a special case), it is hoped that other information such as the variance of the fidelity over 
the twirling/randomizing gate set may provide useful information about the unknown noise model. Indeed in |ll| it 
is suggested that the variance of the fidelity measured under the proposed randomized benchmarking protocol may 
provide useful information about the extent to which the noise is coherent. Our work shows that there are difficulties 
with analyzing the variance of the fidelity in this context. 

First, we observe that the variance decreases exponentially with the number of qubits that are involved in the 
randomized benchmarking or twirling protocol. This implies that an exponentially increasing number of repetitions of 
the protocol would be required to obtain a reliable estimate of the variance. Secondly, we remark that in order for the 
variance to be independent of the initial state and the particular choice of randomizing gates (and hence reflect some 
intrinsic feature of the noise model), the randomizing gates must comprise (or at least generate) a unitary 4-design. 



10[ for further discussion), randomizing under 



Since the Clifford group is only a unitary 2-design (see in particular 
different choices of Clifford gate sets can produce different values for the variance. However, the original randomized 
benchmarking protocol considered in [6|, which suggested using Haar-random gates, will produce a variance that 



17 

depends only on the noise model (assuming, as usual, that the noise is effectively independent of the sequence of 
randomizing gates). While that protocol may be practical for single qubits, or a small numbers of qubits, the fact 
that exponential complexity is required to implement a Haar-random unitary makes the protocol of J| impractical 
for large numbers of qubits. 

Lastly, even under a benchmarking protocol for a single qubit that makes use of a gate set that generates a 4-design, 
our expression shows that the variance depends in a non-trivial way on both the diagonal and off-diagonal elements of 
the x-niatrix. Hence the extent of the coherence of the noise model, understood here as referring to the fact that the 
noise can not be expressed as a Pauli channel, can not be inferred from an estimate of the variance alone. However, 
there remains the possibility that the extent of coherence in the noise could be estimated by comparing results from 
different randomized benchmarking schemes, eg. with and without supplementary Pauli rotations. This would be a 
worthwhile topic for further investigation. 

As a final comment, another application of our results is in the context of simulating quantum systems on a 
quantum computer. This one of the most important potential applications of quantum information processing, and 
the most likely to be possible in the near term. Of course an important shortcoming of efficient quantum simulation 
(relative to inefficient simulation on a classical computer) is that not all the information about the simulated system 
is available upon measurement. This "readout problem" poses a practical obstacle and raises the question of what, if 
any, properties of the system may be estimated with a scalable number of repetitions of the simulation. 

In the context of studying quantum chaos, it was suggested in IJ] that the characteristics of fidelity decay under 



perturbation, an important indicator of quantum chaos, could be estimated in an efficient manner. In particular, 
under the random matrix conjecture for complex and chaotic systems, the fidelity decay can be predicted exactly 
under any known perturbation, and compared to the observed decay. An implicit assumption of that argument is 
that the variance of the fidelity remains small as the system dimension increases so that a reliable estimate of the 
mean is possible with a scalable number of repetitions. Our result on bounding the variance shows that this is indeed 
the case and gives a rigorous justification to that work. 
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A. VARIANCE FOR A SINGLE QUBIT 

In this section Var(J^) is calculated in a more compact form for the case of a single qubit. Since we already have 
a simple expression for J- given by Eq. (jS]) we only need to calculate J-'^ . We will use Eq. Q which will allow us to 
group particular terms together to obtain a more simple expression. 

To begin, we recall some properties of x- First, x is positive and has trace equal to 1. Second, raXi^mPiPm = 
A(l) = (iA(-^), and third, J2i m^i,"^^^nPi = 1 from trace preservation. The 24 terms in Eq. ([9]) are sorted into 
groups of 3 each of which is dealt with separately. Since we are working with a single qubit, d = 2 in all expressions 
below. Note that many of the expressions below only hold under the assumption that d = 2. 

1. First Group of Terms 

The first group consists of the following 10 terms: 

Xl,mXn.rm] [Pm] [Pn] [Pr] + [PlPr] [Pm] [Pa] + [Prr.Pr] [Pi] [Pa] + [Pi] [Prn] [PnPr] + [PlPn] [Prr.] [Pr] 

l,m,n,r 

+ [PlPn][PraPr] + [Pr,. Pn][Pl][Pr] + [PmPn][PlPr] + [Pi Pm][Pn][Pr] + [Pi Pm][Pn Pr]) 

where for ease of presentation we have used square brackets "[ ]" to represent the trace operation. Using the assumed 
properties of the {Pi} basis this group can be written as, 

Xo,iXifi + Xa.o 

1 + 8 f E (^0,' + ^lo) 1 + 4 ( E i^lm + Xl,mXra,l) + 1 ) • (Al) 



2. Second Group of Terms 

The second group consists of the 8 terms 



E Xi,raXnA^<PiPrPnMPm) + tr(P,P„P,)tr(P„0 + tr(P,„P,P„)tr(PO + tr(P,„P„P,)tr(PO 

+tr(P,P,„P;)tr(P„) + tr(P,PzP„Otr(P„) + tr(P„P,„POtr(P,) + tr(P„P;P,„)tr(P,)). 

These 8 terms arc grouped by two and the resulting four sums are calculated independently. For the first sum we 
deal with five cases: 
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Case 1: n ^ r, n ^Q. and r 7^ 0. This implies PnPr = —PrPn and so the above is 0. 
Case 2: n = r. We get 2 Y.i,m.n Xi.mXn.ntr (-P/) tr {Pm) which equals 2xo,orf^- 
Case 3: n = 0. We get, 2 1]!,m,r Xi,mXo,rtr (P/Pr) tr (P,„) which is just 2 Y,i Xi,oXo,id'^- 
Case 4: r = 0. Similarly to case 3 we get 2 J^i xfo^'^- 

Case 5: r — and rt = 0. This case is required because we have over-counted for this case twice above. The result is 
Hence the five cases give in total, 

J2 Xi,mXn,r(tr (P/P,.P„) tr (P„J + tr (PzP„P,) tr (P„0) - 2x0,0^' + 2 ^ XifiXo,id^ + 2 ^ x'o'^' " 4x^,oC«'- 
The other three sums are calculated in a similar fashion and in total, 

8xo,od' + 8 J2 XL0X0,id^ xlid^ + 4 ^ xl^d^ - l&xlfid\ 

I I I 

Substituting d ~ 2 and collecting terms for both the first and second group of terms gives, 

- 48xo,o + 48X1,0 + 32 ^ xli + 16 ^ x'o + 16 Xi,oXo,i + 4 x'™ + 4 ^ (A2) 

3. Third Group of Terms 

Lastly we have the 6 terms 

J2 Xi,mXn,r-(tr {PlPrPnPm) + tr {PlPnP,nPr) + tr {PlP^PrPm) + tl {PlPrPM 
+tr (PiPr^PrPn) + tr iPiPmPnPr)) 

which we group into three pairs as, 

Xl,mXn,r (tr {PPrPuPn) + tr {PlPrPnPn)) 

l,7n,n,r 
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Xi,mXn,' (tr [PlPnPmPr) + tr {PlP,nPnPr)) 

and 

J2 Xi,mX„,r(tr [PiP^PrPn) + tr [PlP^PrPrn)) 

l,m,n,r 

The first pair is easy to calculate using the same cases as above for m and n. The result is, 

X/,mXn,r(tr {PlPrPnP,n) + tv {PiPrPraPn)) = 4 ^ + 8X0,0 "8^X,,oXo,/- 

The second pair requires a bit more effort and wc go through the cases separately, 

Case 1: m ^ n, m ^ and n ^ 0. This case gives 0. 

Case 2: m = n. In this case the pair becomes ^ Xi-mXm.i- 

Case 3: to = 0. The pair becomes 2 J^i n r X/,iXii,rtr (PiPnPr) and after a direct calculation, 

4(xo,o + Xi,o(xo,i + Xl,0 + iX2,3 - «X3,2) + X2,o(XO,2 + X2,0 - «Xl,3 + «X3,l) + X3,o(xo,3 + X3,0 + iXi,2 - iX2,l))- 
Case 4: n = 0. Similar to case 3 we obtain, 

4(X0,0 + XO,l(XO,l + Xl,0 + iX2,3 ~ «X3,2) + X0,2(X0,2 + X2,0 - «Xl,3 + «X3,l) + X0,3(X0,3 + X3,0 + «Xl,2 - «X2,l))- 

Case 5: TO = and n = 0. This case gives Xi,oXo.i- 
Combining the 5 cases gives, 

Y X/,™Xn,r(tr {PlPnPraPr) + tr {PlPr^PnPr)) = 4 ^ 

/ + 8X0,0 

+4 (xo,i + Xi,o) (X0,1 + Xl,0 + i (X2,3 - X3,2)) + 4 (xo,2 + X2,o) (X0,2 + X2,0 + « (X3a - Xl,3)) 

+4 (xo,3 + X3,o) (xo,3 + X3,o + i (Xi,2 - X2,i)) ~ 8 X! Xo,/X;,o 

I 
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The third pair can be expressed as, 

X! X^mX«.r(tr {PlPraPrPn) + tr {PlPnPrPm)) = tr (A^ (A^ (1))) + tr (A (A (1))) = 4 

and so combining the three pairs gives, 

8 ^ Xi,7nXm,i + 16xo,o - 16 ^ Xi,oXo,i +4 + 4 (xoa + Xi.o) (xo,i + Xi.o + « (X2,3 - X3,2)) 

+4 (X0,2 + X2,o) (X0,2 + X2,0 + i (X3,l - Xl,3)) + 4 (xo,3 + X3,o) (X0,3 + X3,0 + i (Xl,2 - X2,l)) ■ 

We can calculate another expression for the three pairs by noting that four of the terms give, 

tr (A (At (1))) + tr (A^ (A (1))) + tr (A (A (1))) + tr (At (At (1))) = 3d + tr (At (A (1))) = 6 + tr (a (1)") 

with the remaining two terms, J2i,m,n,rXLmXn,rtr (PiPnPmPr) and J2i,m,n,rXLmXnAT (PiPrPmPn), being complex 
conjugates of one another. From the calculation of the first pair given above, 

J2 X/,mX«,r(tr (P/P^P„P„,) + tr(P/P^ P,„P„)) = 4^Xi,mXm,/ + 8xi,i - 8^Xi,lXl,/, 

l,m,n,r l,m I 

and since Y.l,m,n,r Xl,7nXn,rtr {PlPrPnPm) = 2, 

^ X/,mXn,rtr (PlPrPmPn) = ^ X/,mXn,rtr {PlPrPmPn) = 4 ^ Xl,mXm,l + 8xo,0 ^ 8 ^ X/,OXO,; - 2. 

Therefore the three pairs can also be written as 

2 + tr (^A (1)') + 8 ^ Xi,n.Xm,i + 16X0,0 - 16 Xi,oXo,i, (A3) 

l,m I 

where by Eq. pT|). 



tr (a (1)^) = 4tr ((trax)^) =2 + 4 (xo,i + Xi.o) (xo,i + Xi.o + i (X2,3 - X3,2)) 
+4 (xo,2 + X2,o) (X0,2 + X2,0 + i (X3,l - Xl,3)) + 4 (xo,3 + X3,o) (X0,3 + X3,0 + i (Xl,2 - X2,l)) 
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Combining all 24 terms given in Eq.'s (KJ) . (|X2|) and (KM . and noting tr (7rsym(4, d)) = 

( - 48xL + 64X0,0 + 24 (xx^ + x^x)o o + 32 (x')o o + 4tr (xx^) + 12tr (x^) + 6 + 4tr ((trzx)') ] 
m 

B. VARIANCE IN LARGE DIMENSIONS 

To deduce the asymptotic behavior of Var (J-) wc use the expression for J-^ given in Eq. (jlOp . From this equation 
one can obtain the following expresssion for Var(J^), 



rd!^ + s(f + ucP + vd + w ,^ , 

^'^'^^^ = dWTMTi)WTuTiy ^^'^ 

where, 



r = -4Xo,o+(xX^)o,o+(x^x)o,o + 2(x')o,o' 

s = "6Xo,o + (xX^)o,o + (^^^)o,o + (X^^) - 4X0,0 + tr (x") + 2 (x')^ „ , 
u = -8x0,0 + tr (xx^) + tr (x') + 2tr (xz,o + Xo,i) Pi^ (1)^ - 1, 

« = 2tr {Y.XUnPiMP.n)^ +2tr|^^(Xi,o + Xo,/)P/A(l)^ +tr((A(l))') -3, 
w = 2tr I ^ x/,™^iA(P,„) +tr((A(l)f 

\l,m J 

The denominator of (|Eip is a quintic polynomial in d. The numerator contains powers of d up to and including d^, 
however the coefficients depend on x- We would like to bound these coefficients in terms of d. 

First, since x is a trace-1 positive semi-definite matrix, we obtain the bounds < Xo o — Xo.o < 1 and < tr (x^) < 
tr (x) = 1- Next, for a linear operator A, the Frobenius (Hilbert-Schmidt) norm of A, denoted by || \\p^ is given by 



\A\\f = Y^tr (j4tA). Using the Cauehy-Schwarz inequality we obtain. 



tr(xX^)|<||xil^||x^ 
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Since x ^nd X"^ have the same singular values, ||x||_f = ||x"^|1_f- Therefore |1x|1_f < 1 ItrfxX"^)! < 1- This also 



implies 



Ux^)o.o 



Hence < tr 



((A(l))') 



< 1 and 



< 1. To deal with A (1), we note that it has trace d and is positive semi-definite. 



The only two coefficients that remain to be bounded are tr (x/,o + Xo,;) Pi^ (1)) and tr \ Xi,mPi^{Pm) 



By the Cauchy-Schwarz inequality, 



tr^X/.oP;A(l) 



< 



^XifiPi 



l|A(l)||^<d 



^XifiPi 



Since, 



^XifiPi 



\ 



\ 



tr 



d{x'),,,<d 



we get |tr (^^ (xi,o + Xo.z) P/A (1))| < 2^2. 

Finally, we need to bound tr {^i^ XL.mPiHPrn)j ■ Using Eq. we have tr (^J2i,m XUmPi^Pn. 

d^tr i^S {Sxi^^ xj where S is the unitary Kraus operator for the SWAP gate. Again, by the Cauchy-Schwarz in- 
equality, 



tr (x^ (^x)"^^) I < llx^lli. II (^X)''^ 1^ = ^tr ((x^) (x^)^) ^tr (^((5xf ^) ^^x) 

<^tr(^((5xf^)^5xf^) 



since ^tr((x^)(x5)^) = ^17^ = ||x||^ < f. For any A, B e L ® H), {A^f' = (A^^)^ and tr ((AS) 
tr (B^iA"^!). Therefore, 



Ti 



tr 



{{Sxf')\sxf') = tr ( ((^x)^)"^' (^X)''M = tr ((^x)''^ ((%)^) 



Ti 



tr(^((5x)U^x))'^') -tr(x2) <f, 



which implies 



tr {El,^Xl,mPlA{Pm)) 
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Combining all of these results implies that the numerator of (|B1I) is quartic in d and so for large d 



Var(^) 



0{d^) 
1 



0[-\. (B2) 
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